Computational methods for cancer survival classification using intermediate information
نویسندگان
چکیده
We study a potentially useful methodology based on machine learning (ML) involving integration of separate biomarker classes, to improve prediction and separation of ovarian cancer survival times. We also imported intermediate survival information for separating extreme two groups. For prediction of survival phenotypes, we use four classifiers, first two existing machine learning methods (support vector machine, SVM; random forest, RF), the second a new regression-based method (REG) feature selection together with Cox proportional hazards model (FSCR), FSCR_REG, the third SVMbased classifier using FSCR data sets (FSCR_SVM). We compared these four methods using three types of cancer tissue features: i) miRNA expression, ii) mRNA expression, and iii) integrated miRNA and mRNA expression information, the latter with features selected separately from miRNAs and mRNAs profiles. The accuracies of survival classification using the combined miRNA/mRNA profiles are higher than those using miRNA or mRNA alone . The latter differences indicate sometimes strong interactions between miRNA and mRNA features which are not visible in individual analyses.
منابع مشابه
Applying Two Computational Classification Methods to Predict the Risk of Breast Cancer: A Comparative Study
Introduction: Lack of a proper method for early detection and diagnostic errors in medicine are some fundamental problems in treating cancer. Data analysis techniques may significantly help early diagnosis. The current study aimed at applying and evaluating neural networks and decision tree algorithm on breast cancer patients’ data for early cancer prediction. Methods: In the current stu...
متن کاملApplying Two Computational Classification Methods to Predict the Risk of Breast Cancer: A Comparative Study
Introduction: Lack of a proper method for early detection and diagnostic errors in medicine are some fundamental problems in treating cancer. Data analysis techniques may significantly help early diagnosis. The current study aimed at applying and evaluating neural networks and decision tree algorithm on breast cancer patients’ data for early cancer prediction. Methods: In the current stu...
متن کاملAnalyzing the Survival of Patients with Breast Cancer with a Multistate Model
Introduction: Breast cancer is the second most common cancer and one of the most common causes of death in women. Metastasis is also a challenge for breast cancer patients, affecting their survival. This study aimed to determine the survival rate and measure the factors affecting the progression of the disease from the time of diagnosis by considering metastasis as an intermediate condition usi...
متن کاملBreast Cancer Diagnosis from Perspective of Class Imbalance
Introduction: Breast cancer is the second cause of mortality among women. Early detection is the only rescue to reduce the risk of breast cancer mortality. Traditional methods cannot effectively diagnose tumor since they are based on the assumption of well-balanced dataset.. However, a hybrid method can help to alleviate the two-class imbalance problem existing in the ...
متن کاملUsing data mining techniques for predicting the survival rate of breast cancer patients: a review article
This review was conducted between December 2018 and March 2019 at Isfahan University of Medical Sciences. A review of various studies revealed what data mining techniques to predict the probability of survival, what risk factors for these predictions, what criteria for evaluating data mining techniques, and finally what data sources for it have been used to predict the surv...
متن کامل